PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG029805t4
Common NameTCM_029805
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family HB-other
Protein Properties Length: 1743aa    MW: 196561 Da    PI: 5.329
Description HB-other family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG029805t4genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox58.89.1e-192984257
                      T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHHC CS
          Homeobox  2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakekk 57
                       kR+  t++qle+Le++++ + yps+++r+ L++klgL++rq ++WF+ rR kekk
  Thecc1EG029805t4 29 PKRQMKTPYQLEALEKAYALETYPSEATRAGLSEKLGLSDRQLQMWFCHRRLKEKK 84
                      69*****************************************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.601.0E-18785IPR009057Homeodomain-like
SuperFamilySSF466891.45E-161785IPR009057Homeodomain-like
PROSITE profilePS5007116.2122585IPR001356Homeobox domain
SMARTSM003891.7E-162789IPR001356Homeobox domain
PfamPF000461.7E-162984IPR001356Homeobox domain
CDDcd000861.73E-133085No hitNo description
SMARTSM005714.2E-22542601IPR018501DDT domain
PROSITE profilePS5082716.486542601IPR018501DDT domain
PfamPF027911.0E-16543598IPR018501DDT domain
PfamPF050667.4E-15724791IPR007759HB1/Asxl, restriction endonuclease HTH domain
PfamPF156123.1E-8937981IPR028942WHIM1 domain
PfamPF156131.5E-1211221194IPR028941WHIM2 domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0010228Biological Processvegetative to reproductive phase transition of meristem
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1743 aa     Download sequence    Send to blast
MDPGSEEENN PSKNPNKNVN SSNEGHVKPK RQMKTPYQLE ALEKAYALET YPSEATRAGL  60
SEKLGLSDRQ LQMWFCHRRL KEKKETPSKK PRKGAALPPE SPIDDLHAGP EPDYGSGSGS  120
GSSPYTDSRK LGGSSSRGMT EDVPTARRYY ESQQSIMELR AIACVEAQLG EPLRDDGPML  180
GMEFDPLPPD AFGAIPEPQK RSGHPYESKA YERHDGRSSK AAVRALHEYQ FLPEHASLRS  240
DAYGQVTQSH FHESPVDGAR ARATSFVHGE EPLPRVHGIQ GHGSRVRVLP QQDKTGIIPT  300
SSQVADDSLA ERESFTNGRL NTQSIGHPVL GSEDSYVLST GQTLNIDADL RNDRKRKSDE  360
NRIAREVEAH ENRIRKELEK LDLKRRKSEE RMRKEMERHA RERRKEEERL VREKQREEER  420
SQREQRREME RREKFLQKEC LRAEKRRQKE ELRREKEAER RRVAMEKATA RKIAKESMDL  480
IEDEQLELME LAAASKGIPS IIHLDHDSLQ NLESFRDSLS LFPPKSVQLK RPFAIQPWID  540
SEENVGNLLM AWRFLITFAD VLRLWPFTLD EFVQAFHDYD SRLLGEIHVA LLKSIIKDIE  600
DVARTPSTGL GMNQYCAANP EGGHPQIVEG AYSWGFDIRN WQRHLNPLTW PEIFRQLAIS  660
AGLGPQLKKR NAAWTFMGDN DEGKGCEDVV STLRNGSAAE NAFVLMREKG LLLPRRSRHR  720
LTPGTVKFAA FHVLSLEGRE GLTVLELADK IQKSGLRDLT TSKTPEASIS VALTRDAKLF  780
ERIAPSTYCV RPAYRKDPTD AEAILAAARK KIRQFENGFL GGEDADEVER DEVERDEESE  840
CDVDEEPEVD DIATPSNANK DADYPKDEVN TCSGSGKVHV STDALNVPSE FDKDFSSFPP  900
NIMKDANGPS NTGQYVAREE MGTGNPDQQN IEIDESKSGE SWIQGLSEGE YSHLSVEERL  960
NALVALIGIA NEGNSIRAVL EDRLEAANAL KKQMWVEAQL DKSRLKEETM VKMDFPSMMG  1020
IKAEPQLPNS VVEGSQSPFP AAYNKNDEAS PSIPDDQKPL LCSQNVQNDL NSYPAERALV  1080
LQEASMGPDN FSAQQIGHAS KRSRSQLKSY IAHRAEEMYV YRSLPLGQDR RRNRYWQFVA  1140
SASKNDPCSG RIFVELRDGN WRLIDSEEAF DTLLTSLDAR GIRESHLRIM LQKIETSFKE  1200
NVRRNLQCAR AIGRSGSSTE NEVSELDSSP DFPASFDSPS SAICGLNFDA LETLPSFKIQ  1260
LGRNENEKKL ALKRYQDFQR WIWKECYNSS TLCAMKYGKK RCVQLLAVCD VCLRSHIPEE  1320
MHCGYCHQTF GSVNNSFNFS EHEIQCKENR KLDTKDTCTI DYSLPLGISL LKSLCALVEV  1380
SIPPEALESV WIEGRRKMWG RELNASSSVD ELLKILTHLE SAIKRDHLLS NFETTKELLG  1440
SNLQSESDSS VSVLPWIPET TAAVALRLLE LDVSIMCVKQ EKVEPSENKE ARAYIKLPSR  1500
TSLFIKNKEL ELKELDQDEA MKEENFADMS HSKRNSYKRG RGGREQGSGR KWQRRASGSR  1560
YDTGKRSARE KNNLSFRLKQ QGQRTNGRSS GRGRRTVRKR AERRAADNTM VARVADVIKP  1620
KVSDVRDLDE EWRTEKFRVM QMVNPPDSNS AEEESDDNAQ GEGYGQGNWD LDYNGASNGW  1680
NAEAMEASDE DDDAYEDDNG VEQLGEEDSD GDLEISDASD VVANKAGNDD GSDLAVSEDY  1740
SD*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
17784RRLKEKKE
2383404KRRKSEERMRKEMERHARERRK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAJ5665763e-78AJ566576.1 Theobroma cacao microsatellite, clone mTcCIR255.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007025540.10.0Homeodomain-like transcriptional regulator, putative isoform 1
RefseqXP_007025541.10.0Homeodomain-like transcriptional regulator, putative isoform 1
RefseqXP_007025543.10.0Homeodomain-like transcriptional regulator, putative isoform 1
SwissprotF4HY560.0RLT1_ARATH; Homeobox-DDT domain protein RLT1
TrEMBLA0A061GFK30.0A0A061GFK3_THECC; Homeodomain-like transcriptional regulator, putative isoform 1
STRINGPOPTR_0011s05660.10.0(Populus trichocarpa)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G28420.10.0homeobox-1
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]